首页> 外文OA文献 >Simple methods for improving speaker-similarity of HMM-based speech synthesis
【2h】

Simple methods for improving speaker-similarity of HMM-based speech synthesis

机译:用于改善基于Hmm的语音合成的说话者相似性的简单方法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

In this paper we revisit some basic configuration choices of HMM based speech synthesis, such as waveform sampling rate, auditory frequency warping scale and the logarithmic scaling of F0, with the aim of improving speaker similarity which is an acknowledged weakness of current HMM-based speech synthesisers. All of the techniques investigated are simple but, as we demonstrate using perceptual tests, can make substantial differences to the quality of the synthetic speech. Contrary to common practice in automatic speech recognition, higher waveform sampling rates can offer enhanced feature extraction and improved speaker similarity for speech synthesis. In addition, a generalized logarithmic transform of F0 results in larger intra-utterance variance of F0 trajectories and hence more dynamic and natural-sounding prosody.
机译:在本文中,我们将重新探讨基于HMM的语音合成的一些基本配置选择,例如波形采样率,听觉频率翘曲标度和F0的对数缩放,目的是提高说话者的相似性,这是当前基于HMM的语音的公认弱点合成器。所有研究的技术都很简单,但是,正如我们使用知觉测试所证明的那样,可以对合成语音的质量产生实质性的影响。与自动语音识别中的常规做法相反,较高的波形采样率可以提供增强的特征提取功能和语音合成中说话人的相似性。此外,F0的广义对数变换会导致F0轨迹的话语内方差更大,从而使韵律更加动感自然。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号